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3 <110> APPLICANT: URQUIMA, S.A. 

5 <120> TITLE OF INVENTION: Promotor and constructions for expression of 

6 recombinant proteins in filamentous fungi 
8 <130> FILE REFERENCE: Thaumatin 2 . 

C--> 10 <140> CURRENT APPLICATION NUMBER: US/09/647 , 543 
C--> 11 <141> CURRENT FILING DATE: 2000-10-02 

13 <150> PRIOR APPLICATION NUMBER: ES 9800699 

14 <151> PRIOR FILING DATE: 1998-04-02 
16 <160> NUMBER OF SEQ ID NOS : 13 

18 <170> SOFTWARE: Patentin Ver. 2.1 

20 <210> SEQ ID NO: 1 

21 <211> LENGTH: 2570 

22 <212> TYPE: DNA 

23 <213> ORGANISM: Aspergillus awamorii 

25 <220> FEATURE: 

26 <221> NAME/KEY: intron 

27 <222> LOCATION: (785),. (850) 

2 9 <220> FEATURE: 

3 0 <221> NAME/KEY: intron 

31 <222> LOCATION: ( 1414 )..( 1471 ) 

33 <220> FEATURE: 

34 <221> NAME/KEY: CDS 

35 <222> LOCATION: (741) .. (785) 

37 <220> FEATURE: 

38 <221> NAME/KEY: CDS 

39 <222> LOCATION: (850 ).. (1413 ) 

41 <220> FEATURE: 

42 <221> NAME/KEY: CDS 

43 <222> LOCATION: ( 1472 ).. (2242 ) 

45 <400> SEQUENCE: 1 

46 tctagattgc gacggcgtat tgcttatcct tagtaggact ccctaatgga ttccgagcaa 60 
48 gaaaagactg tttggcgtgt accaatggct catagtacca gcaagagaag aattttctct 12 0 
50 ctcgcttcga gaaagcaatc aaaaaaaaat cctatcctac cctaccctac cctaatactt 180 
52 ccattgccac ccgattcctc ccgatagtag agcgggcgac tgccatttgg cgggcgggcc 240 
54 agcggattcc cgccgataga taacgggcag attctgtgac ctcaaactat cgactaacag 300 
56 cccgaacttc ggcggccacc gccaaacccg ccccggaagc cggcctcatt tgccgtttgg 360 
58 gcgtgccagg aaatgccgcc tgcagcggag actccctagt gtggtctgtg ttgcctgtgt 42 0 
60 cgtctgtgta gtatactagt tactagtcta ctactgtaca gtggatggcc tgaggggggg 480 
62 actttatgtc cgactccggc tgttctcctc cctctatcca ctctaccctc ttccctctct 540 
64 tctgtctttc tccccgctct cgcccctccc ctcctcgaaa acataaatcg gcctttcccc 600 
66 ctcgccatct tcttcttctt ctccctctcc tttctctttc ttcttcagac tacttctctt 660 
68 tctttcatct tttctctata ttcctgtttt cctagatacc ccagttaaaa aagttctctc 720 
70 aatcaatcct ccccttcaga atg tct aac ctt cct cac gag ccc gag ttc gag 773 
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71 Met Ser Asn Leu Pro His Glu Pro Glu Phe Glu 

72 1 5 10 

74 cag gcc tac aag ggtatgttcc attgcccctc cgaaattgat gatggaaaaa 825 

75 Gin Ala Tyr Lys 

76 15 

78 aaattctaac aacatcctct taca gag ctt gcc teg acc ctt gag aac tec 876 

79 Glu Leu Ala Ser Thr Leu Glu Asn Ser 

80 20 

82 acc etc ttc cag aag aac cec gaa tac cgc aag gee ctt get gtc gte 924 

83 Thr Leu Phe Gin Lys Asn Pro Glu Tyr Arg Lys Ala Leu Ala Val Val 

84 25 30 35 40 

86 tec gtc cec gag cgt gtc ate cag ttc cgt gtc gtc tgg gag gat gat 972 

87 Ser Val Pro Glu Arg Val lie Gin Phe Arg Val Val Trp Glu Asp Asp 

88 45 50 55 

90 gcc ggc aac gtc cag gtc aac cgc ggt ttc cgt gtc cag ttc aac age 1020 

91 Ala Gly Asn Val Gin Val Asn Arg Gly Phe Arg Val Gin Phe Asn Ser 

92 60 65 70 

94 gcc etc ggt cec tac aag ggt ggt ctt cgt ttc cac cec tec gtc aac 1068 

95 Ala Leu Gly Pro Tyr Lys Gly Gly Leu Arg Phe His Pro Ser Val Asn 

96 75 80 85 

98 ttg tec ate etc aag ttc ctt ggt ttc gag cag ate ttc aag aat get 1116 

99 Leu Ser lie Leu Lys Phe Leu Gly Phe Glu Gin lie Phe Lys Asn Ala 

100 90 95 100 

102 etc act ggc ctg aac atg ggt ggt ggt aag ggt ggt tec gae ttc gac 1164 

103 Leu Thr Gly Leu Asn Met Gly Gly Gly Lys Gly Gly Ser Asp Phe Asp 

104 105 110 115 120 

106 cec aag ggc aag tee gae aac gag ate cgt cgc ttc tgt gtt tec ttc 1212 

107 Pro Lys Gly Lys Ser Asp Asn Glu lie Arg Arg Phe Cys Val Ser Phe 

108 125 130 135 

110 atg acc gag etc tgc aag cac ate ggt gee gae act gat gtt ecc get 1260 

111 Met Thr Glu Leu Cys Lys His lie Gly Ala Asp Thr Asp Val Pro Ala 

112 140 145 150 

114 ggt gae ate ggt gtc acc ggt cgt gag gtc ggt ttc etc ttc ggc cag 1308 

115 Gly Asp He Gly Val Thr Gly Arg Glu Val Gly Phe Leu Phe Gly Gin 

116 155 160 165 

118 tac cgc aag ate cgc aac cag tgg gag ggt gtt etc ace ggt aag ggt 1356 

119 Tyr Arg Lys He Arg Asn Gin Trp Glu Gly Val Leu Thr Gly Lys Gly 

120 170 175 180 

123 ggc age tgg ggt ggt tec etc ate cgc cet gag gcc acc ggt tac ggt 1404 

124 Gly Ser Trp Gly Gly Ser Leu He Arg Pro Glu Ala Thr Gly Tyr Gly 

125 185 190 195 200 

12 7 gtt gtc tac gtatgtcaat tectcttctt atgattatet atgtataaca 1453 
128 Val Val Tyr 

130 gcgactaaeg egtaaeag tac gtc gag cac atg att get cac gcc acc aac 1504 

131 Tyr Val Glu His Met He Ala His Ala Thr Asn 

132 205 210 

134 ggc cag gag tec ttc aag ggc aag cgc gtt gee ate tec ggt tec ggt 1552 

135 Gly Gin Glu Ser Phe Lys Gly Lys Arg Val Ala He Ser Gly Ser Gly 

136 215 220 225 230 
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138 


aac 


gtt 


gcc 


cag 


tac 


gcc 


gcc 


etc 


aag 


gtc 


att 


gag 


etc 


ggc 


ggt 


tec 


1600 


139 


Asn 


Val 


Ala 


Gin 


Tyr 


Ala 


Ala 


Leu 


Lys 


Val 


He 


Glu 


Leu 


Gly 


Gly 


Ser 




140 










235 










240 










245 






142 


gtc 


gtc 


tec 


ctg 


age 


gac 


aeg 


cag 


ggc 


tee 


etc 


ate 


ate 


aac 


ggc 


gag 


1648 


143 


Val 


Val 


Ser 


Leu 


Ser 


Asp 


Thr 


Gin 


Gly 


Ser 


Leu 


He 


He 


Asn 


Gly Glu 




144 








250 










255 










260 








146 


ggt 


age 


ttc 


acc 


cec 


gag 


gag 


ate 


gag 


etc 


ate 


get 


cag 


ace 


aag 


gtc 


1696 


147 


Gly 


Ser 


Phe 


Thr 


Pro 


Glu 


Glu 


He 


Glu 


Leu 


He 


Ala 


Gin 


Thr 


Lys 


Val 




148 






265 










270 










275 










150 


9^9 


cgc 


aac 


gag 


etc 


gee 


age 


ate 


gtc 


ggt 


get 


get 


cec 


ttc 


age 


gac 


1744 


151 


Glu 


Arg 


Asn 


Glu 


Leu 


Ala 


Ser 


He 


Val 


Gly 


Ala 


Ala 


Pro 


Phe 


Ser Asp 




152 




280 










285 










290 












154 


gcc 


aac 


aag 


ttc 


aag 


tac 


att 


get 


ggt 


gee 


cgc 


cec 


tgg 


gtt 


cac 


gtc 


1792 


155 


Ala 


Asn 


Lys 


Phe 


Lys 


Tyr 


He 


Ala 


Gly 


Ala 


Arg 


Pro 


Trp 


Val 


His 


Val 




156 


295 










300 










305 










310 




158 


ggc 


aag 


gtc 


gac 


gtc 


get 


etc 


cec 


tec 


get 


ace 


cag 


aac 


gaa 


gtt 


tec 


1840 


159 


Gly 


Lys 


Val 


Asp 


Val 


Ala 


Leu 


Pro 


Ser 


Ala 


Thr 


Gin 


Asn 


Glu 


Val 


Ser 




160 










315 










320 










325 






162 


ggc 


gag 


gag 


gcc 


cag 


gtc 


etc 


ate 


aac 


get 


ggc 


tge 


aag 


ttc 


ate 


gcc 


1888 


163 


Gly 


Glu 


Glu 


Ala 


Gin 


Val 


Leu 


He 


Asn 


Ala 


Gly 


Cys 


Lys 


Phe 


He 


Ala 




164 








330 










335 










340 








166 


gag 


ggt 


tec 


aac 


atg 


ggt 


tge 


ace 


cag 


gag 


gee 


ate 


gac 


ace 


ttc 


gag 


1936 


167 


Glu 


Gly 


Ser 


Asn 


Met 


Gly 


Cys 


Thr 


Gin 


Glu 


Ala 


He 


Asp 


Thr 


Phe 


Glu 




168 






345 










350 










355 










170 


gcc 


cac 


cgt 


ace 


gee 


aac 


get 


ggc 


gcg 


get 


gcc 


ate 


tgg 


tac 


gee 


ecc 


1984 


171 


Ala 


His 


Arq 


Thr 


Ala 


Asn 


Ala 


Gly 


Ala 


Ala 


Ala 


He 


Trp 


Tyr 


Ala 


Pro 




172 




360 










365 










370 












174 


ggt 


aag 


gcc 


gcc 


aac 


gcc 


ggt 


ggt 


gtc 


get 


gtc 


tec 


ggt 


ctg 


gag 


atg 


2032 


175 


Gly 


Lys 


Ala 


Ala 


Asn 


Ala 


Gly 


Gly 


Val 


Ala 


Val 


Ser 


Gly 


Leu 


Glu 


Met 




176 


375 










380 










385 










390 




178 


get 


cag 


aac 


tet 


gee 


cgc 


etc 


age 


tgg 


act 


tet 


gag 


gag 


gtt 


gat 


gee 


2080 


179 


Ala 


Gin 


Asn 


Ser 


Ala 


Arg 


Leu 


Ser 


Trp 


Thr 


Ser 


Glu 


Glu 


Val 


Asp Ala 




180 










395 










400 










405 






184 


cgt 


ctt 


aag 


gac 


ate 


atg 


cgc 


gac 


tge 


ttc 


aag 


aac 


ggt 


ctt 


gag 


act 


2128 


185 


Arg 


Leu 


Lys 


Asp 


He 


Met 


Arg 


Asp 


Cys 


Phe 


Lys 


Asn 


Gly 


Leu 


Glu 


Thr 




186 








410 










415 










420 








188 


get 


cag 


gag 


tac 


gee 


acc 


cec 


get 


gag 


ggt 


gtc 


ctg 


cet 


tec 


ctg 


gtg 


2176 


189 


Ala 


Gin 


Glu 


Tyr 


Ala 


Thr 


Pro 


Ala 


Glu 


Gly 


Val 


Leu 


Pro 


Ser 


Leu 


Val 




190 






425 










430 










435 










192 


acc 


gga 


tec 


aac 


att 


gcc 


ggt 


ttc 


acc 


aag 


gtg 


get 


gee 


gee 


atg 


aag 


2224 


193 


Thr 


Gly 


Ser 


Asn 


He 


Ala 


Gly 


Phe 


Thr 


Lys 


Val 


Ala 


Ala 


Ala 


Met 


Lys 




194 




440 










445 










450 












196 


gac 


cag 


ggt 


gac 


tgg 


tgg 


taaatgegga aagccgcaaa eececgcggc 




2272 


197 


Asp 


Gin 


Gly 


Asp 


Trp 


Trp 
























198 


455 










460 
























200 


ttatgtcatg acgattatgt agtttgatgt tccctttcag 


cgcggatgga • 


tagaggegcc 


2332 


202 


ggtgttttct tgctagttta gatggatgca taatgatatc 


ettttcttaa " 


tcetcaaatt 


2392 


204 


cttgtaattt gttgtatcaa tagtagataa tacaactgta 


gteaaetaec < 


cttgcatctt 


2452 


206 


cactatttgc agatgcattc atctctattc cgagcacatg 


eacaaaccca • 


tgggaecgea 


2512 
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208 


gttcactagt acttagcctg ttatcttccc tctatcgcat 


cttaaacaac tatctaga 


211 


<210> SEQ ID NO; 


; 2 
























212 


<211> LENGTH: 460 
























213 


<212> TYPE: 


PRT 


























214 


<213> ORGANISM: 


Aspergillus 


awamorii 














216 


<400> SEQUENCE: 


2 
























217 


Met 


Ser 


Asn 


Leu 


Pro 


His 


Glu 


Pro 


Glu 


Phe 


Glu 


Gin 


Ala 


Tvr 


Lvs 


Glu 


218 


1 








5 










10 










15 




220 


Leu 


Ala 


Ser 


Thr 


Leu 


Glu 


Asn 


Ser 


Thr 


Leu 


Phe 


Gin 


Lys 


Asn 


Pro 


Glu 


221 








20 










25 










30 






223 


Tvr 


Arq 


Lvs 


Ala 


Leu 


Ala 


Val 


Val 


Ser 


Val 


Pro 


Glu 


Ara 


Val 


He 


Gin 


224 






35 










40 










45 








226 


Phe 


Ara 


Val 


Val 


Ttd 


Glu 


Asp 


Asp 


Ala 


Glv 


Asn 


Val 


Gin 


Val 


Asn 


Arcr 

AJ.y 


227 




50 










55 










60 










229 


Glv 


Phe 


Aro 


Val 


Gin 


Phe 


Asn 


Ser 


Ala 


Leu 


Glv 


Pro 


TVT" 

xyj. 


Lys 


Glv 


Glv 


^ J \j 












70 










75 










80 


232 


Leu 


Airg 


Phe 


His 




Ser 


Val 


Asn 


J-JC Li. 


Ser 


He 


Leu 


Lys 


Phe 


Leu 


Glv 


5 










85 










90 










95 




235 


Phe 


Glu 


Gin 


He 


Phe 


Lys 


Asn 


Ala 


Leu 


Thr 


Glv 


Leu 


Asn 


Met 


Glv 


Glv 


9 

^ O D 








100 










105 










110 






238 


Glv 


Lys 


Glv 


Glv 


Ser 


Asp 


Phe 


Asp 


Pro 


Lys 


Gly 


Lys 


Ser 


Asp 


Asn 


Glu 


9 Q 






115 










120 










125 








241 


lie 






Phe 


Cys 


Val 


Ser 


Phe 


Met 


Thr 


Glu 


Leu 


Cys 


Lys 


His 


He 






130 










135 










140 










245 


Glv 


Ala 


Asp 


Thr 


Asp 


Val 


Pro 


Ala 


Gly 


Asp 


He 


Glv 


Val 


Thr 


Glv 




246 


145 










150 










155 










160 


248 


Glu 


Val 


Glv 


Phe 


Leu 


Phe 


Glv 

\jj.y 


Gin 


Tvr 


Ara 


Lys 


He 


Arg 


Asn 


Gin 


X J-p 


24Q 










165 










170 










175 




251 


Glu 


Glv 


Val 


Leu 


Thr 


Glv 


Lys 


Glv 


Glv 


Ser 




Glv 


Glv 

vjxy 


Ser 


Leu 


He 


252 








180 










185 










190 






254 


Arg 


Pro 


Glu 


Ala 


Thr 


Glv 


Tvr 

A y J. 


Glv 

vjj.y 


Val 


Val 


Tvr 

xy J. 


Tvr 

xy X 


Val 


Glu 


His 


Met 


255 






195 










200 










205 








257 


He 


Ala 


His 


Ala 


Thr 


Asn 


Gly 


Gin 


Glu 


Ser 


Phe 


Lys 


Glv 


Lvs 


Arcr 


Val 


258 




210 










215 










220 










260 


Ala 


He 


Ser 


Gly 


Ser 


Gly 


Asn 


Val 


Ala 


Gin 


Tvr 


Ala 


Ala 


Leu 


Lvs 


Val 


261 


225 










230 










235 










240 


263 


He 


Glu 


Leu 


Gly 


Gly 


Ser 


Val 


Val 


Ser 


Leu 


Ser 


Asp 


Thr 


Gin 


Gly 


Ser 


264 










245 










250 










255 




266 


Leu 


He 


He 


Asn 


Gly 


Glu 


Gly 


Ser 


Phe 


Thr 


Pro 


Glu 


Glu 


He 


Glu 


Leu 


267 








260 










265 










270 






269 


He 


Ala 


Gin 


Thr 


Lys 


Val 


Glu 


Arg 


Asn 


Glu 


Leu 


Ala 


Ser 


He 


Val 


Gly 


270 






275 










280 










285 








272 


Ala 


Ala 


Pro 


Phe 


Ser 


Asp 


Ala 


Asn 


Lys 


Phe 


Lys 


Tyr 


He 


Ala 


Gly 


Ala 


273 




290 










295 










300 










275 


Arg 


Pro 


Trp 


Val 


His 


Val 


Gly 


Lys 


Val 


Asp 


Val 


Ala 


Leu 


Pro 


Ser 


Ala 


276 


305 










310 










315 










320 


278 


Thr 


Gin 


Asn 


Glu 


Val 


Ser 


Gly 


Glu 


Glu 


Ala 


Gin 


Val 


Leu 


He 


Asn 


Ala 


279 










325 










330 










335 




281 


Gly 


Cys 


Lys 


Phe 


He 


Ala 


Glu 


Gly 


Ser 


Asn 


Met 


Gly 


Cys 


Thr 


Gin 


Glu 
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282 






340 






345 










350 






284 


Ala 


He 


Asp Thr Phe 


Glu Ala 


His 


Arg 


Thr 


Ala 


Asn 


Ala 


Gly 


Ala 


Ala 


285 






355 




360 










365 








287 


Ala 


He 


Trp Tyr Ala 


Pro Gly 


Lys 


Ala 


Ala 


Asn 


Ala 


Gly 


Gly 


Val 


Ala 


288 




370 




375 










380 










290 


Val 


Ser 


Gly Leu Glu 


Met Ala 


Gin 


Asn 


Ser 


Ala 


Arg 


Leu 


Ser 


Trp 


Thr 


291 


385 






390 








395 










400 


293 


Ser 


Glu 


Glu Val Asp 


Ala Arg 


Leu 


Lys 


Asp 


He 


Met 


Arg 


Asp 


Cys 


Phe 


294 






405 








410 










415 




296 


Lys 


Asn 


Gly Leu Glu 


Thr Ala 


Gin 


Glu 


Tyr Ala 


Thr 


Pro 


Ala 


Glu Gly 


297 






420 






425 










430 






299 


Val 


Leu 


Pro Ser Leu 


Val Thr 


Gly 


Ser 


Asn 


He 


Ala 


Gly 


Phe 


Thr 


Lys 


300 






435 




440 










445 








302 


Val 


Ala 


Ala Ala Met 


Lys Asp 


Gin 


Gly 


Asp 


Trp 


Trp 










303 




450 




455 










460 











306 <210> SEQ ID NO: 3 

307 <211> LENGTH: 33 

308 <212> TYPE: DNA 

309 <213> ORGANISM: Artificial Sequence 

311 <220> FEATURE: 

312 <223> OTHER INFORMATION: Description of Artificial Sequence: 

313 Oligonucleotide ThSl 

315 <400> SEQUENCE: 3 

316 cgaatgaaaa ggaaaaggat ggccaccttc gag 33 

319 <210> SEQ ID NO: 4 

320 <211> LENGTH: 18 

321 <212> TYPE: DNA 

322 <213> ORGANISM: Artificial Sequence 

324 <220> FEATURE: 

325 <223> OTHER INFORMATION: Description of Artificial Sequence : Oligonucleotide 

326 ThS2 

328 <400> SEQUENCE: 4 

32 9 ttattaggcg gtggggca 18 

332 <210> SEQ ID NO: 5 

333 <211> LENGTH: 18 

334 <212> TYPE: DNA 

335 <213> ORGANISM: Artificial Sequence 

337 <220> FEATURE: 

338 <223> OTHER INFORMATION: Description of Artificial Sequence: 

339 Oligonucleotide lA 

341 <400> SEQUENCE: 5 

342 atgtctaacc ttcctcac 18 

345 <210> SEQ ID NO: 6 

346 <211> LENGTH: 18 

347 <212> TYPE: DNA 

348 <213> ORGANISM: Artificial Sequence 

350 <220> FEATURE: 

351 <223> OTHER INFORMATION: Description of Artificial Sequence: 

352 Oligonucleotide IB 
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